Macroscopically local correlations can violate information causality 
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Although quantum mechanics is a very successful theory, its foundations are still a subject of intense debate. 
One of the main problems is the fact that quantum mechanics is based on abstract mathematical axioms, rather 
than on physical principles. Quantum information theory has recently provided new ideas from which one 
could obtain physical axioms constraining the resulting statistics one can obtain in experiments. Information 
causality and macroscopic locality are two principles recently proposed to solve this problem. However none 
of them were proven to define the set of correlations one can observe. In this paper, we present an extension 
of information causality and study its consequences. It is shown that the two above-mentioned principles are 
inequivalent: if the correlations allowed by Nature were the ones satisfying macroscopic locality, information 
causality would be violated. This gives more confidence in information causality as a physical principle defining 
the possible correlation allowed by Nature. 



Introduction. — Despite quantum theory being almost one 
century old, its axioms remain mathematical in nature, and 
we are still searching for physical principles from which to 
derive it. After the initial discussions of the founding fathers, 
the operationalist viewpoint prevailed, and most were satis- 
fied with a useful and strikingly precise theory. With the ad- 
vent of quantum information science, however, a new source 
from which to draw principles of a more intuitive flavor was 
unveiled. Following this direction, many recently works have 
been devoted to study which tasks would be possible if cor- 
relations more general than the ones allowed by quantum me- 
chanics could be achieved GJQ. 

There are several motivations for these studies. First, they 
allow us to understand what is special about quantum mechan- 
ics, and consequently to get more intuition on it. Following 
this direction, results like the impossibility of instantaneous 
transmission of information [ 1 ] and of cloning, the existence 
of monogamy of correlation, and secrecy were shown not 
to be particular features of quantum mechanics. Second, if 
one finds a practical task limiting the correlations to be quan- 
tum, this would immediately gain the status of a physically- 
motivated defining property of quantum correlations. In these 
lines, constraints on distributed computing 0, communica- 
tion complexity |4 ] and the like, can be thought of as can- 
didate requisites from which the theory could in principle be 
derived. Finally, it could be possible that quantum mechan- 
ics is not the definitive description of nature, and more gen- 
eral correlations might be observed in the future. Thus, some 
physically motivated sets of correlations which are known to 
be bigger than the set of quantum correlations appears as can- 
didates for possible generalizations of quantum mechanics. 

A pioneering effort in this direction was that of Popescu 
and Rohrlich, who asked if the impossibility of transmitting 
information instantaneously (i.e. the no-signalling principle) 
is enough to characterize the set of quantum correlations. It 
turns out that this is not the case. There exist sets of correla- 
tions satisfying the no- signalling principle that are not achiev- 
able in quantum physics. The tantamount example of this is 



given by the Popescu-Rohrlich (PR) box (H. 

In this vein, other information theoretic flavored principles 
were proposed as candidates from which the set of physi- 
cally realizable correlations could be constrained (5HH- In 
this work, we will be concerned with two recent propos- 
als: Macroscopic Locality (ML) [5 ] and Information Causal- 
ity (IC) (6). In a nutshell, ML states that the coarse-grained 
statistics of correlation experiments should admit a local hid- 
den variable model, that is, those statistics do not violate any 
Bell's inequality (8j. IC, on the other hand, states that if Bob 
receives m bits of information from Alice, he cannot obtain 
more information about Alice's system than m bits, even if he 
shared with her some prior resource. For the m = case, IC 
simply reduces to the no-signalling principle. Formally, IC is 
derived by introducing a game in which Bob has to guess one 
of Alice's independent and random bits (chosen at random), 
and bounding the sum of mutual informations of Bob's differ- 
ent guesses with Alice's actual data by the amount of one way 
communication from Alice to Bob. 

The purpose of this work is twofold. First, we extend the 
IC game to the case in which Alice's data is composed by d- 
dimensional alphabets (dits), of which Bob has to randomly 
guess one, allowing in principle for m dits of communication 
from Alice to Bob. As in the original case, we find an extremal 
non-signalling box that perfectly solves this task, and analyze 
the effect of adding noise to this set of correlations. The sce- 
nario we find in this case proves richer than the original, which 
allows us in turn to, second, show the inequivalence between 
IC and ML. In particular, we show that there exist correlations 
that satisfy MC, but violate IC. This implies that it is very un- 
likely that ML is the defining property of possibly obtainable 
correlations, since in this case one would have to accept that 
IC is violated. 

We will start by reviewing the principle of ML, together 
with another tentative at characterizing the quantum set of 
correlations: the Semi Definite Program (SDP) hierarchy by 
Navascues, Pironio, and Acm (NPA) [9 ], which is intimately 
related with the former. We then discuss the principle of IC by 
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introducing our task, which generalizes the original one. We 
then analyze the newly found scenario and compare it to the 
previously known case, discussing the implications. In partic- 
ular, we show how, through this extension, IC comes closer 
to the quantum set than ML, thus gaining more thrust as a 
candidate for a physical axiom defining the set of quantum 
correlations. 



Macroscopic Locality and the NPA hierarchy. — Macro- 
scopic Locality | 5 ] was introduced more as a principle pro- 
ducing an alternative to quantum theory than as an axiom for 
it. Since its inception, it was known that, when considered 
together with the no-signalling principle, it does not give rise 
to the set of quantum correlations, but, instead, to a larger set. 
This set is precisely the one labeled by Qi in [9], that is, the 
first step in a hierarchy of SDPs that eventually converge to 
the quantum set. It was known already that, in some scenar- 
ios, further steps in the hierarchy are strictly contained in Qi, 
while also containing the quantum set |9|. Thus, it sufficed to 
prove the equivalence of Qi and the ML set to realize that this 
axiom would not suffice to fully determine quantum correla- 
tions. 

ML retains its interest, though, both as an example of prin- 
ciple with physical content that can be put forth as candidate 
for axiom, and as a testable requirement from which a theory 
larger than quantum arises, and which could in principle be 
used to disprove quantum physics. As we mentioned before, 
it demands that classical physics be recovered in the large par- 
ticle number limit by imposing that the coarse-grained cor- 
relations arising in an experiment involving a macroscopic 
number of particles can be modeled with local hidden vari- 
ables. This requirement applied to microscopic probabilities 
is what gives Bell local sets of correlations, which satisfy Bell 
inequalities. Less restraining, ML imposes this constraint on 
their macroscopic counterpart; all Bell local correlations thus 
satisfy ML, but the converse fails to hold. To date, ML had not 
been contrasted with other attempts at axiomatizing quantum 
correlations from physical postulates. 



Information Causality and its extension to arbitrary alpha- 
bet dimensions. — As we stated before, Information Causality 
was introduced through the use of a game with two players, 
Alice and Bob, and in which Alice is given N independent 
and random bits of which Bob has to guess one, chosen at 
random. In order to achieve this, they can share prior to the 
game any amount of physical resources they want (classical 
correlations, entangled states, or, in a hypothetical scenario, 
nonlocal post-quantum boxes), and Alice is further allowed to 
send m bits to Bob after she receives her input. We general- 
ize this task by extending the alphabet in which Alice's inputs 
range from 2 to an arbitrary number d of dimensions. Bob 
then has to guess, again using prior shared resources but now 
allowing Alice to send him m dits, one of Alice's inputs, again 
chosen at random. 



IC, in fact, imposes a bound on 
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1= ^I(x K :G\y = K), 



(1) 



K=0 



where xk are Alice's independent and random input dits, y is 
Bob's random input telling him which xk to aim for, and G 
is Bob's guess at that chunk of Alice's data. I(xk : G\y = 
K) is in turn the Shannon mutual information between Bob's 
guess and Alice's input xk, given that he aims at guessing 
that particular input. IC, stated in general, imposes that: 



I < m log 2 d. 



(2) 



While all quantum correlations satisfy this requirement, it is 
still an open question whether there are post-quantum corre- 
lations that satisfy IC ifTOl . 

We now study to what extent the IC condition defines the 
quantum set. In order to do this, we consider a protocol 
which perfectly solves the task when an extremal set of non- 
signalling correlations is used as a resource. By adding noise 
to this box, we approach the quantum set. We start by con- 
sidering the case N = 2. Alice and Bob share a box with d 
inputs on Alice's side and 2 inputs on Bob's, and d outputs 
on both sides (see Fig. [I]). We label by a (b) the output of the 
box on Alice's (Bob's) side. The message Alice sends to Bob 
is M = (a — xq) mod d, and Bob's guess is G = (b — M) 
mod d = (b — a + xq) mod d. 

The probability of Bob making a correct guess is: 

Ps = P(G = x ,y = 0)+P(G = x 1 ,y = l) = 
\[P(G = x \y = 0) + P(G = Xl \y = 1)] = 
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Yd Y} p ^ G = x ^ = j>y = °) + p (° = *i l*i =3,y = i) 

(3) 



where we used that both Alice and Bob's inputs are unbiased. 
In the case y = 0, the condition for a correct guess G = xq 
is equivalent to b — a = mod d. When y = 1, we need 
to split the cases according to x = j with j = 0, . . . , d — 1, 
and the correct guess condition G = Xj is equivalent, in each 
case, to b — a = j mod d. Putting this together, we have for 
the success probability: 



d-l 



P s = ^^2i p ( b - a = modd\x =j,y = 0) + 



3=0 



(4) 



P(b — a — j mod d\xi = j, y = 1)]. 



Consider now the case in which Alice and Bob share a box 
defined as: 

PR (ab\xy) = | Q otherwige , (5) 

with a, 6, x G {0, . . . , d — 1}, and y G {0,1}. This is a 
straightforward generalization of the PR box [1], to which 
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it reduces in the case d = 2. It is extremal in the d2dd 
non- signalling polytope, as can be verified by using the same 
counting argument as in the proof of Theorem 1 of Ref. fY2\ . 
This box clearly satisfies P(b — a = x.y mod d\xy) = 1, 
which leads to a probability of success equal to unity (which is 
the case depicted in Fig.[T]). This in turn implies 1 = 2 log 2 d, 
which violates the IC condition ([2]) since m = 1. 



xo,xi G{0,...,d-1} 

f ^ x=(xi-xo)mod d 



T 

a 



VG{0,1} 



G=(b-M)mod d 



FIG. 1 : Information Causality protocol for d-dimensional alpha- 
bets. Suppose Alice is given two dits, xq and x\, while Bob gets 
a bit y. Bob is asked to guess the value of one of Alice's dits, ac- 
cording to the value of y. In the case that Alice and Bob share the 
noiseless box {5J they can solve this problem perfectly. In order to do 
that Alice inputs (xi—xo) mod d into the box, while Bob inputs y. 
After receiving her output a, Alice sends a message M = (a — xq) 
mod d, corresponding to one dit of communication. Bob, in posses- 
sion of M, makes his guess G = (b — M) mod d = (b — a + xo) 
mod d. Given that the box behaves as x.y = (b — a) mod d, Bob 
computes the value G = [(xi — xo).y + xo] mod d, which equals 
xq if y = and x\ if y = 1. 



We 



now define an isotropic noisy box as follows: 



PR(E) =E PRo + (l-£)l 

=E PR + ^^( PR o + • • • + PRd-i), 

that is, a mixture of the generalized PR box §5§ and a box 
1, representing classical random noise. Box 1 outputs com- 
pletely random dits regardless the inputs, and can be decom- 
posed, as we did in the second line of Eq. ([6]), in terms of PRo 
plus d — 1 other extremal non- signalling boxes, denoted by 
PRj with j e {1, • • • , d - 1}, s.t. 



PRj(ab\xy) 



1/d if x.y = (b — a) + j mod d 
otherwise 



(7) 



The parameter E (0 < E < 1) quantifies the amount of noise 
in the box. This noisy correlations satisfy P(B — A = x.y 
mod d\xy) = so we have for the success probabil- 

Although we could search for the critical value of E for 
a single box to stop violating IC, it was seen already in the 
original d = 2 case that this is not optimal 0. Instead, one 
needs to consider the task illustrated in Fig. [T] for an arbitrary 
input size TV, and extend the protocol described by nesting 
many instances of it. 



For the case of isotropic noisy boxes and d = 2, it was seen 
that a suitable nesting of the protocol was enough to recover 
the quantum bound |6|. In the Appendix, we describe this 
nesting and its extension to arbitrary dimension d. This exten- 
sion proves straightforward, apart from the consideration of 
the cancellation of errors of different boxes, which now hap- 
pens less frequently due to the larger outcome alphabet. We 
find a recurrence relation for the probability of success, and 
by solving it, we get for the success probability of n boxes 
(see the Appendix for details): 



(d-l)E n + l 



(8) 



This is the probability of Bob producing a correct guess after 
the use of n boxes. Note that, given the structure of the nest- 
ing, one needs to use n = log 2 N boxes if the input consists 
of N dits. 

In order to study the violation of IC in terms of the proba- 
bility of a successful guess, we use Fano's inequality ifTTll to 
bound /, as was already discussed for arbitrary d-dimensional 
alphabets in (6) : 

I > 2 n (log 2 d - h(P s ) - (1 - Ps) log 2 (d - 1)) , (9) 

where h(p) = — p log 2 p—{l—p) log 2 (1 —p) is the binary en- 
tropy, and we set equal probabilities for the correct guessing 
of the different dits, as is the case in the protocol. Consider- 
ing then this bound for the success probability in the nested 
protocol given in Eq. (ISl), we obtain: 



I>2 r ' 



log 2 d-h 



1 + (d - l)E r > 
d 



d- 1 



(l-^)Iog 2 (d-l) 



(10) 



Consequences of the IC extension and inequivalence with 
ML. — We now turn to the consequences of the extension of IC 
we have introduced. In Fig. [2] we show the critical value of E 
for which IC ceases to be violated, as a function of the number 
of boxes used, n, and for different values of d. This value is 
obtained from ( fTO] ). For d = 2, the critical value of E asymp- 
totically approaches Tsirelson's bound E T = l/\/2 ~ 0.707, 
which defines the extent of the quantum set for isotropic cor- 
relations. Therefore, we recover the result that IC defines 
isotropic quantum correlations for d = 2, which also coin- 
cide with those satisfying ML (or, equivalently, are in Qi). 
It was shown, however, that this protocol does not define the 
entire quantum boundary even for d = 2 fTOll . 

For larger values of d, however, the situation proves richer. 
First, we notice that for sufficiently large d, the curves go be- 
low the critical value Et. This value is therefore no longer 
representative of the extent of quantum correlations. However, 
it still defines the extent of Macroscopically Local isotropic 
correlations, as we have checked for values of d up to 5 by 
solving the SDP that defines the set Qi. This thus proves the 
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FIG. 2: Critical noise level for IC violation. We plot the critical 
amount of noise for which IC ceases to be violated, as a function of 
n, and for different values of d (d = 2 circles, d = 5 squares, d = 10 
diamonds). The solid line corresponds to l/y/2, which, according to 
our numerical calculations up to d = 5, coincides with ML. 
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FIG. 3: Scenario comparison for the 2222 and dldd cases. In the 

2222 case both IC and ML stop being violated for the same amount 
of noise E — 1/ V2. For the case dldd with d > 2, the violation 
of IC can stand more noise than that of ML, which shows that ML 
correlations can violate IC. 



d 


Eic 




Eq 


2 


0.707 
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< 0.705 


0.707 
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< 0.700 
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TABLE I: Critical values of noise for increasing alphabet dimen- 
sion. Here, E\c is the critical noise for which IC ceases to be vi- 
olated using the protocol described, optimized over n. Eml is the 
corresponding value for ML correlations (Qi), and Eq is the extent 
of the quantum set, ignored apart from the d = 2 case. All values to 
10 -3 accuracy. 



correlations. This allowed us to show the inequivalence be- 
tween IC and ML. The extension we presented might prove 
useful in deciding whether IC is a good candidate as axiom 
for quantum correlations, question that still remains open. 

As a final remark, we note that the extension remains valid 
in the d —> oo case. Interestingly, through this extension, Bob 
would be able to guess (to arbitrary precision) either one of 
two arbitrary precision floating point numbers of Alice using 
just one of the extremal dldd boxes we introduced. How this 
compares as a resource to an asymptotic number of standard 
PR 2222 boxes is a question for further study. 



inequivalence of ML and IC. Table [I] summarizes the critical 
values obtained for the different sets. 

Unfortunately, using either numerical searches or plausible 
analytical guesses, we were unable to find quantum realiza- 
tions for the isotropic correlations defined by E\q for d > 2 
above the local boundary. So, contrary to the d = 2 case, we 
do not know if the IC strategy studied here reaches the quan- 
tum boundary. We sketch the situation pictorially in Fig. [3] 
where we compare the d = 2 and larger d cases. 

Finally, we stress a curious feature of the curves presented 
in Fig. [2] Differently from the d = 2 case, in which the crit- 
ical level of noise monotonously approaches Et, for larger 
values of d the curves display a minimum in the critical noise 
for a finite value of n. The position of this minimum further 
decreases with increasing d. While this effect is not yet fully 
understood, it might hint towards the non-optimality of the 
protocol considered. We leave this question for further inves- 
tigations. 

Discussion. — We have extended the principle of IC to arbi- 
trary input alphabet dimensions and studied the consequences 
of this extension. The scenario we found turned out to be 
richer than the original case, even if only considering isotropic 
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APPENDIX: NESTED PROTOCOL 

Here we discuss the nesting of the protocol described in the 
main text needed in order to tackle the task when N > 2. The 
nesting process is similar to that originally presented in (6), 
but differs in the treatment of accumulated errors. For clarity, 
suppose Alice is given N = 4 dits, xq,x\,X2, and xs, of 
which Bob has to learn one, indexed by the random bits he 
receives, yo and y\ (see Fig. [4]). Now Alice and Bob share 
three boxes. If these were the noiseless PRo boxes of Eq. ([5]), 
they could perfectly solve the task. The main idea (explained 
in details in Fig. [4} is that they can the first box to reveal the 
value of either xq or x\, the second box to X2 or x%. In the 
first case Bob would have to know a message M', while in the 
second M" . Since only one message can be transmitted, they 
use the third box to reveal to Bob either M' or M" , depending 
on which dit he wants to know. 

Clearly, this process can be repeated an arbitrary amount of 



times, with Alice and Bob sharing 2 n — 1 boxes for a task in- 
volving N = 2 n dits. Note however that only n of these boxes 
are actively used in the protocol, in the sense that the outcome 
of the remaining ones is irrelevant for the correctness of the 
guess. So now we look at how the errors accumulate in the 
different stages of the nested protocol. The joint probability 
of success for two different boxes a and j3 is given by: 

P f = p«p§ + _ p«)(i _ p§ ), (id 

where the first term corresponds to both boxes succeeding, 
and the second term arises from error compensation: of the 
(d — l) 2 "a wrong, j3 wrong" events, only d — 1 lead to a 
cancellation. We can now think of the probability of success 
for n + 1 boxes as the joint success of n boxes and a single 
box: 

p (n+l) = p (n) ps + _ p (n) )(1 _ pg) _ ^ 

This recurrence can be solved for Ps = , the proba- 

bility of success of a noisy PR box, to finally get: 

p (n) = (d-iy-M 
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xo,xi,x 2 ,x 3 G{0 d-1} 



(xi-xo)mod d 



M'=(a'-xo)mod d 



(x3-X2)mod d 

\ 



yo, yi 6(0,1} 



yo 
I 




T 

b' 



M"=(a"-x 2 )modd 



(M"-M')mod d 




yi 




b'" 



Mrfi?rrM')Ql°d _ d . . ^ G=[b , -(b"'-M'")]mod d 

FIG. 4: Nesting of the protocol for N > 2. We illustrate how to recursively employ the N — 2 protocol in order to solve the IC task for 
larger values of N. Depicted is the case of N — 4, where Bob aims to know the value of either of the four dits xo, x\ , X2, or x$. Alice inputs 
(xi — xo) mod d in the first box, and would have to send the message M' — [a — xo) mod d to make Bob to be able to know either xo or 
x\. She proceeds similarly in the second box, but now using X2 and X3. In this case the message she would send would be M" — (a 1 — X2) 
mod d. Since she can send only one message, they use a third box to make Bob able to know either M' or M" . To this end, Alice inputs 
(M" - M') mod d to the third box, and sends the message M'" = (M" - M') mod d to Bob. By inputing y x = 0, 1 to the third Box he 
can guess the value of M' or M", depending on which dit he is looking for. Having this information he uses either box 1 or box 2 to discover 
the dit he aims for. In the figure we supposed that Bob wants to know the value of either xo or x\. In this case he would input yi = in 
the third box, find the value of M' ', input yo to first box and finally guess the value of the bit he wants according to G = [b' — (b'" — M'")] 
mod d. For arbitrary N this process can be iterated straightforwardly. 



